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ABSTRACT 

The  conditions  for  optimality  of  uniform  orthonormal  subband  coders  are  reviewed.  A  number  of  properties  of  optimal 
filter  banks  are  then  summarized.  The  case  of  nonuniform  orthonormal  filter  banks  is  also  considered,  and  it  is  shown 
that  the  well  known  connection  between  optimal  coding  gain,  energy  compaction,  and  principal  component  property 
does  not  extend  to  the  nonuniform  case. 
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1.  INTRODUCTION 

The  optimization  of  orthonormal  filter  banks^>^  for  fixed  input  statistics  has  been  of  some  interest  in  recent  years®“^®. 
Similar  to  classical  transform  coders,  the  optimality  criterion  is  usually  the  coding  gain  (so  that  the  mean  square  recon¬ 
struction  error  due  to  subband  quantization  is  minimized).  Some  of  the  advances  in  recent  literature  have  addressed  the 
theory  of  uniform  filter  banks  (i.e.,  identical  decimation  ratio  in  all  channels)  from  the  contexts  of  energy  compaction®’^, 
and  principal  component  analysis  Numerical  optimization  of  the  coding  gain  has  also  been  addressed*^’^^’^^’^®.  A 
set  of  necessary  and  sufficient  conditions  for  optimality  was  developed  in^®,  and  it  was  shown  that  these  conditions 
can  be  satisfied  by  the  design  of  a  sequence  of  compaction  filters.  The  uniform  orthonormal  filter  bank  optimized  for 
coding  gain  is  also  the  prinicipal  component  filter  bank,  with  each  filter  acting  as  an  energy  compaction  filter  for  an 
appropriate  power  spectrum  related  to  the  input.  In  this  tutorial  we  first  review  these  results,  and  then  consider  the 
case  of  nonuniform  orthonormal  filter  banks.  For  this  case  we  will  see  that  there  is  no  simple  connection  between  coding 
gain,  energy  compaction,  and  the  principal  component  property. 

Preliminaries 

The  subband  coder  shown  in  Fig.  1(a)  is  said  to  maximally  decimated  if  the  decimation  ratios  satisfy  53^  l/n^  =  1. 

It  is  a  uniform  subband  coder  if  the  Uk  —  M  for  all  k.  All  discussions  except  those  in  Sec.  4  are  restricted  to  this  case. 

In  this  case  we  have  the  polyphase  representation^  shown  in  Fig.  1(b).  The  filter  bank  is  said  to  be  a  biorthogonal  or 

perfect  reconstruction  filter  bank  if  R(2:)E(2)  =  I.  It  is  said  to  be  orthonormal  or  paraunitary  if  is  unitary  for 

all  u;.  In  the  orthonormal  case  the  perfect  reconstruction  condition  is  =  /^/(e-^*^).  In  terms  of  the  filters  we  can 

express  orthonormality  as^  =  6{k-m)  where  the  notation  F{e^'^)\iM  denotes  the  Fourier  transform 

[M 

of  f{Mn).  The  orthonormality  condition  implies  in  particular  that  each  filter  Hi{e^'^)  satisfies  the  Nyquist(M)  constraint 

=  I*  Traditional  contiguous-stackings  of  brickwall  filters  (Fig.  2)  serve  as  examples  of  orthonormal  filter 
IM 

banks.  In  these  two  examples  each  filter  is  an  aliasfree(M)  filter.  An  aliasfree(M)  or  antialia$(M)  filter  is  defined  to  be 
one  whose  output  can  be  decimated  without  aliasing,  that  is,  the  shifted  versions  overlap  for 

distinct  kinO<k<M  —  l.  Equivalently  we  say  that  the  filters  have  aliasfree(M)  supports.  Such  a  support  could  have 
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multiple  number  of  passbands  (e.g.,  see  Fig.  7  later.) 


Statistical  Model,  The  input  x{n)  is  assumed  to  be  zero-mean  wide  sense  stationary  (WSS)  with  power  spectral 
density  (psd)  The  subband  signals  Vi{n)  (see  Fig.  1(a))  are  therefore  (zero-mean  and)  jointly  WSS,  and  the 

variances  of  Xi{n)  and  Vi{n)  are  the  same,  that  is,  =  a^..  We  model  the  quantizers  with  additive  noise  sources 
qi{n)  (Fig.  1(a)),  and  assume  these  noise  sources  to  be  jointly  WSS  with  zero  mean  and  variances  of  the  form^’^^ 

where  bi  is  the  number  of  bits  assigned  to  the  ith  subband  quantizer.  The  constant  c  (which 
depends  on  the  nature  of  the  pdf  of  the  quantizer  input^^  is  assumed  to  be  the  same  for  all  subbands.  This  model  does 
not  require  that  each  qi{n)  be  white  or  that  any  two  noise  sources  be  uncorrelated. 


Coding  Gain,  The  quantity  h  =  ^  bi/M,  which  is  the  average  bit  rate,  is  assumed  to  be  fixed.  The  coding  gain 

of  a  subband  coder  is  defined  by  comparing  the  average  mean  square  value  Ssbc  of  the  reconstruction  error  x{n)  —x{n) 
with  the  m.s.  value  Edirect  of  the  direct  quantization  error  (roundoff  quantizer)  with  the  same  bit  rate  b.  Using  the 
above  noise  model,  the  coding  gain  Gsbc{M)  of  the  uniform  orthonormal  subband  coder  of  Fig.  1(a)  can  be  derived^: 


/T^/r\  ^  ^direct  52i=0 

Gsbc(M)  = -  ^ 


^SBC 


l/M 


(1) 


This  expression  assumes  optiinal  bit  allocation  as  described  in^’^®.  Here  we  have  used  the  result  ^  valid 

for  uniform  orthonormal  filter  banks.  The  preceding  coding  gain  is  the  ratio  of  the  arithmetic  and  geometric  means, 
AM/GM  ratio,  of  the  subband  variances  a^..  For  fixed  input  psd  the  variances  depend  only  on  the  analysis 

filters  We  say  that  the  subband  coder  is  optimal  (for  the  fixed  M  and  5xx(e-^^)),  if  these  filters  are  such  that 

the  coding  gain  is  maximized.  From  (1)  we  see  that  coding  gain  optimization  is  equivalent  to  minimizing  the  product  of 
subband  variances.  If  the  psd  Sxx{^^^)  is  monotone  decreasing  then  the  contiguous  stacking  (Fig.  2)  yields  an  optimal 
orthonormal  filter  bank.  For  arbitrary  psd,  each  filter  in  the  optimal  system  can  have  many  passbands^^. 


2.  OPTIMALITY  OF  UNIFORM  ORTHONORMAL  FILTER  BANKS 

We  now  present  the  conditions  for  the  optimality  of  an  uniform  orthonormal  filter  bank. 

Total  Decorrelation  Of  Decimated  Subbands 

In  orthogonal  transform  coding  theory  (where  E(z)  in  Fig.  1(b)  is  a  constant  unitary  matrix),  decorrelation  of  the 
decimated  subband  random  variables  Vi{n)  is  necessary  and  sufficient  for  optimality.  That  is,  the  coding  gain  is  maximum 
if  and  only  if  E[vi{n)v^{n)]  =  0  for  z  ^  A:,  and  for  all  n.  For  orthonormal  subband  coders,  a  stronger  condition  is  necessary: 
namely,  the  decimated  subhand  mndom  processes  should  be  uncorrelated,  that  is. 


(2) 


for  i  ^  k,  and  for  all  n,  m.  This  condition  will  also  be  referred  to  as  total  decorrelation  of  subbands.  Equivalently,  the 
power  spectrum  matrix  of  the  vector  process  v(n)=  [z;o(?^)  t^i(^)  •  •  •  vm-iM  must  be  diagonal.  That  is,  letting 

Si{e^'^)  be  the  psd  of  Vt(n),  we  should  have  Svv(e'^‘^)  =  diag  [iS'o(e-^‘^)  5'i(e^‘^)  . . .  5M-i(e-^‘^)  ] .  Even  this  stronger 

condition  is  not  sufficient  for  optimality.  For  example  the  traditional  brickwall  subband  coder  in  Fig.  2  decorrelates  the 
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subband  processes  trivially  for  any  input  process.  The  preceding  necessary  condition  follows  from  the  fact  that  if  a  pair 
of  decimated  subband  processes,  say  t;o(.)  and  are  not  uncorrelated,  then  we  can  insert  a  delay  z~^  and  a  unitary 
matrix  0  to  transform  the  pair  t;o(n),vi(n-  A:)  into  an  uncorrelated  pair  «;o(^)  and  wi{n)  (Fig.  3).  The  modified  filter 
bank  continues  to  be  orthonormal.  It  can  be  shown  that  this  modification  is  such  that  which  shows 

that  the  coding  gain  has  been  increased.  Thus,  if  total  decorrelation  is  not  true,  the  coding  gain  can  be  increased.  Note 
that  total  decorrelation  of  the  undecimated  subbands  Xfc(n)  is  not  necessary. 

Using  the  preceding  necessary  condition,  we  can  show  that  the  optimal  uniform  orthonormal  subband  coder  has 
Gsbc{M)  =  1  if  and  only  if  the  (zero-mean  WSS)  input  process  x{n)  has  power  spectral  density  of  the  form  Sxxi^^"^)  ~ 
Yu  this  case  all  the  decimated  subband  signals  Vi{n)  have  identical  psd  5(e*^^).  The  condition  Sxx{^^^)  = 
5(giu;M)  jg  equivalent  to  the  condition  that  the  autocorrelation  Rxx{k)  of  x{n)  is  nonzero  only  when  fc  is  a  multiple  of 
M.  Thus,  there  exist  nonwhite  inputs  for  which  orthonormal  SBC  might  yield  no  gain,  for  some  fixed  M.  However, 
unless  the  input  is  white,  this  will  not  happen  for  all  values  of  M. 

Majorization  of  Decimated  Subbands 

We  say  that  the  set  of  decimated  subband  signals  Vk{n)  has  the  majorization  property  if  their  power  spectra  {5'fc(e^‘^)} 
satisfy 

SQ{e^^)  >  >  . . .  >  5M-i(e^'^),  for  all  a;.  (3) 

That  is,  the  kth  subband  psd  dominates  the  {k  H-  l)th  psd  for  all  a;.  This  is  demonstrated  in  Fig.  4. 

Theorem  !•  Majorization  is  necessary.  For  fixed  input  psd  Sxxie^"^)  and  fixed  M,  suppose  a  subband  coder  is 
optimal  (in  the  coding  gain  sense)  among  the  class  of  all  M-band  uniform  orthonormal  subband  coders.  Then  the 
decimated  subband  signals  Vk{n)  have  the  majorization  property.  0 

Proof,  Assume  majorization  is  not  satisfied,  e.g.,  let  So{e^^)  >  Sx{e^'^)  be  not  valid  for  all  a;,  even  though  . 

Cascade  a  new  paraunitary  matrix  T(e^’‘^)  to  the  right  of  the  given  filter  bank  as  shown  in  Fig.  5,  where 

1m  if5o(e>)>5i(e^-) 

T(e^‘^)=<  [010]  (4) 

1  0  0  if  5o(e^’^)  <51(6-?^^) 

^  0  0  Ia/-2_ 

The  new  pair  of  power  spectra  SQ{e^'^),  S[{e^'^)  will  then  satisfy  the  property  for  all  a;.  Moreover  for 

each  a;,  50(6^^*^)  >  5o(e-^'^)  whereas  <  Si{e^'^).  Thus  the  variances  of  the  new  signals  i£;o(^)  and  tyi(n)  are  such 

that  and  But  -h  (since  T{e^^)  is  paraunitary).  Using  this  we  can  verify 

^wo^wi  <  coding  gain  can  been  increased  whenever  majorization  is  not  satisfied.  V  VV 

Necessary  and  Sufficient  Condition  For  Optimality 

Though  majorization  and  decorrelation  are  necessary  for  optimality,  neither  of  them  is  individually  sufficient.  For 
example,  the  brickwall  subband  coder  with  contiguous  stacking  (e.g.,  Fig.  2)  satisfies  the  total  decorrelation  property 
for  any  input  psd.  On  the  other  hand  the  orthonormal  system  with  Hk{z)  =  z~^  satisfies  majorization  for  any  input 
though  it  yields  no  coding  gain.  However  we  have^®: 
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Theorem  2.  A  necessary  and  sufficient  condition  for  optimality.  For  fixed  input  psd  Sxx{g^^),  the  filter  bank 
has  maximum  coding  gain  (among  all  M  band  uniform  orthonormal  subband  coders)  if  and  only  if  the  decimated 
subband  signals  simultaneously  satisfy  total  decorrelation  and  majorization.  Furthermore,  when  these  conditions 

are  satisfied,  the  set  of  power  spectra  {iS'fc(e^'^)}  of  the  decimated  subband  signals  is  unique.  0 

Proof.  From  Fig.  1(b)  we  have  Svv(e-^‘^)  =  E(e^‘*')SB(e^‘^)Et(e-^*‘^)  where  SB(e*^‘*')  and  Sw(e^'*')  are  power  spectra 
of  the  vectors  XB(n)  and  v(n)  indicated  in  the  figure.  If  E(e^‘^)  performs  total  decorrelation,  Sw(e^‘^)  is  diagonal: 

Sw(e'‘")  =  E(c^“)SB(eJ-)Et (e^“)  =:  diag  {So{e^n  S,{en  . . .  SM-iie^n)  (5) 


Since  E(e*^‘*')  is  unitary,  this  implies  that  for  each  fixed  a;,  the  subband  power  spectra  ^^(e-^^)  are  eigenvalues  of  SB(e-^‘^). 
Suppose  the  majorization  property  also  holds.  This  means  that  for  each  a;  these  eigenvalues  are  ordered  in  a  decreasing 
fashion.  Since  the  set  of  eigenvalues  is  unique,  each  diagonal  element  in  (5)  is  uniquely  determined  for  each  a;.  Thus 
the  set  of  power  spectra  {iSjfe(e*^‘*^)}  which  has  the  majorization  property  is  unique  as  claimed  in  the  theorem.  Since 
majorization  and  total  decorrelation  are  necessary  for  optimality,  and  since  there  is  only  one  set  of  majorized  decorrelated 
subband  power  spectra,  it  follows  that  majorization  together  with  decorrelation  leads  to  optimality.  VV  V 

Since  the  diagonalizing  eigenvector  matrix  may  not  be  unique,  the  analysis  filters  of  the  optimal  system  may  not 
be  unique. 


3.  COMPACTION  FILTERS  AND  CODING  GAIN  MAXIMIZATION 

Fig.  6  shows  a  filter  with  a  zero- mean  WSS  input  x(n)  having  psd  This  can  be  regarded  as  an  M-fold 

decimation  filter,  that  is,  one  branch  of  an  M-channel  analysis  bank.  Consider  the  problem  of  designing  such 

that  the  output  variance  ay  is  maximized  subject  to  the  constraint  that  be  Nyquist(M),  that  is. 


A/-1 


=1,  i.e.,  27r/:/M)^|2  _  for  all  a;  (Nyquist  constraint). 

\,M 


(6) 


The  solution  H{e^^)  is  an  optimum  com  paction  (M)  filter.  The  ratio  a^/a^  is  called  the  compaction  gain.  The  Nyquist 
constraint  is  imposed  because  |i/fc(e-^^)P  is  Nyquist(M)  for  every  analysis  filter  in  any  uniform  orthonormal  filter 

bank.  This  constraint  implies  the  unit-energy  property  |i/(e^‘^)l^du;/27r  =  1  as  well  as  the  boundedness  property 
<  M.  The  role  of  the  energy  compaction  concept  in  subband  coding  theory  has  been  observed  by  a  number 
of  authors®"^"*.  The  following  is  a  refined  version  for  arbitrary  M,  of  Unser’s  construction  of  compaction  filters®:  (a) 
For  each  frequency  a;o  in  0  <  a;  <  27r/M  define  the  M  alias  frequencies  u>k  =  oJo-\~  27:kfM,  where  0  <  A:  <  M  —  1.  (b) 
Compare  the  values  of  5xx(e-^‘^)  at  these  M  alias  frequencies  {u;^}.  Let  L  be  the  smallest  integer  such  that  5xx(e-^‘^^)  is 
a  maximum  in  this  set.  Then  assign 


j^(g.?(t-^o+(2fe7r/A^)))  —  I  y/M 


when  k  =  L 
otherwise. 


(7) 


Repeating  this  for  each  ljq  in  the  region  0  <  a;  <  27r/M,  the  filter  //(e^"*")  is  completely  defined  for  all  a;  in  0  <  u;  <  27r, 
This  filter  maximizes  the  output  variance  a^  under  the  Nyquist(M)  constraint  (6). 
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Remarks.  If  is  an  optimal  compaction(M)  filter  for  an  input  psd  then  it  will  be  a  valid  optimal 

solution  for  the  modified  psd  where  /[.]  >  0  is  any  nondecreasing  function.  If  a  psd  is  non  increasing 

in  [0,27r),  then  the  optimum  compaction  filter  is  lowpass.  While  the  optimal  compaction  filter  is  not  unique,  the 
construction  described  above  has  the  following  special  properties:  (a)  is  an  ideal  two-level  filter  with  passband 

response  =  y/M  and  stopband  response  =  0.  (b)  is  an  antialias(M)  filter.  This  follows  because  if  ^  0  for 

some  w,  then  =  0  for  1  <  A:  <  M  —  1.  (c)  The  total  width  of  all  passbands  is 

To  describe  the  construction  of  optimum  (uniform  orthonormal)  filter  banks,  consider  the  example  of  input  psd 
shown  in  Fig.  7(a),  and  let  M  =  4.  The  first  step  is  to  choose  one  filter,  to  be  an  optimal  energy  compaction 

filter  for  the  input  psd  Sxx{^^'^)>  This  filter  is  shown  in  Fig.  7(b).  Let  the  passband  support  of  be  denoted  5o. 

Suppose  we  define  a  new  psd 

si^Jien  = 

as  shown  in  Fig.  7(c).  Thus  is  obtained  by  peeling  off  the  portion  of  Sxx{^^’^)  falling  in  the  passband  of 

We  design  the  next  analysis  filter  /7i(e^‘^)  to  be  the  optimal  compaction  filter  for  this  partial  psd  5xx  (e^^). 
This  is  shown  in  Fig.  7(d).  Define  the  next  partial  psd  by  peeling  off  the  portions  of  Sxx{c^^)  in  the  passbands 

of  Hoie^"^)  and  i7i(e-^‘^),  and  continue  in  this  manner.  Thus  all  the  analysis  filters  can  be  identified  (part  (e)  in  the 
figure).  The  proof  that  this  construction  results  in  an  optimal  orthonormal  system  can  be  found  in^^.  Most  readers  will 
recognize  that  the  preceding  construction  yields  a  principal  component  filter  bank^^.  This  connection  between  coding 
gain,  principal  component  property,  and  compaction  gain  does  not  carry  over  to  the  case  of  nonuniform  filter  banks 
(Sec.  4). 

In  general  the  optimum  compaction  filters  are  ideal  infinite  order  filters.  In  practice,  if  we  approximate  these  with 
FIR  filters  rather  than  approximating  Fig.  2  as  done  traditionally,  the  coding  gain  can  be  significantly  improved.  It  can 
be  shown  that  the  maximum  compaction  gain  is  unity  for  an  input  psd  Sxxi^^"^)  (for  a  fixed  value  of  M)  if  and  only  if 
it  has  the  form  Sxxi^^"^)  =  In  fact,  the  following  three  statements  are  equivalents^:  (a)  Sxx{^^^)  has  the  form 

(b)  Maximum  compaction  gain  Omax{M)  =  1.  (c)  Any  M-band  orthonormal  SBC  yields  coding 

gain  =  1. 


r  0  in  5o 

1 5xx(e^^)  otherwise 


4.  NONUNIFORM  FILTER  BANKS 

The  nonuniform  filter  bank  of  Fig.  1(a)  is  orthonormal^^  if  the  analysis  filters  satisfy  =  6{k  —  m) 

•  idkm 

where  gkm  —  9c<l{nk,nm}'  The  coding  gain  of  the  nonuniform  M-band  orthonormal  filter  bank  has  been  derived  in^® 
and  is  given  by 

V/-  <’> 

Ui=0 

Consider  the  special  case  of  a  two-level  tree  structured  filter  bank  (Fig.  8),  which  is  equivalent  to  a  3-channel  nonuniform 
FB.  The  coding  gains  Gi  and  G2  of  the  two-channel  filter  banks  at  the  levels  1  and  2  are  given  by  Gi  =  ^x! \J^yi<^x2 
and  G2  =  (Ty^  /  Thus  the  coding  gain  of  the  three  channel  filter  bank  is 

GsBci^) 


GiGl^^ 
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More  generally,  consider  the  M  channel  dyadic  tree  structure  with  M  —  1  levels  (Fig.  9(a)).  Assuming  each  two  channel 
filter  bank  is  orthonormal,  we  have  an  M-band  nonuniform  orthonormal  filter  bank,  and  the  coding  gain  is^® 

Gsbc{M)  ^  . .  (10) 


where  Gm  is  the  coding  gain  at  level  m.  Thus  the  extra  benefit  offered  by  the  mth  split  decays  exponentially  with  m. 
Another  consequence  of  (10)  is  that,  if  the  tree  structure  maximizes  the  coding  gain  for  a  given  input  then  the  right- 
flushed  subtrees  indicated  in  Fig,  10  are  optimal  for  their  respective  inputs.  However,  since  these  inputs  depend  on  the 
filters  preceding  them,  this  observation  is  not  directly  useful  to  identify  the  optimal  filter  bank. 


Coding  Gain  and  Compaction  Gain 

Consider  the  input  power  spectrum  shown  in  Fig.  11(a).  Assuming  that  the  process  is  Gaussian,  the  rate-distortion 
theoretic  upper  bound  on  the  coding  gain,  given  by^^  Gup  =  <^1!  exp  Ln  5a;x(e-^‘^)^i*;/27r,  has  the  value 

_  2  +  c  +  d 

4(c(i)l/4  ^  ^ 

Suppose  we  use  the  two-level  orthonormal  tree  structured  filter  bank  shown  in  Fig.  8,  with  filter  responses  as  in  Fig. 

11(b)  and  11(c).  The  coding  gains  of  the  individual  levels  are 

2 -f"  c  +  d  c-\- d 

= - - -  ,  Cr2  =  777=^ 

A  c±d  2ycd 

2 

The  total  coding  gain  of  the  tree,  given  by  is  therefore  equal  to  (11)  showing  that  the  tree  achieves  the  upper 

bound  on  the  coding  gain.  This  means  in  particular  that  the  choice  of  filters  shown  in  Fig.  11  results  in  an  optimal 
orthonormal  two-level  dyadic  tree.  Next,  Fig.  11(d)  shows  the  effective  filter  Hiq{z)H2o{z^)  of  the  top  channel  (which 
has  effective  decimator  [  4)  of  the  three  channel  filter  bank.  Since  c  <  1,  this  is  clearly  not  an  optimum  compaction(4) 
filter  for  the  input  psd.  This  example  shows  that,  even  though  the  coding  gain  is  optimized,  the  top  filter  Hio{z)H2o{z‘^) 
is  not  an  optimum  compaction  filter  for  the  input  x(n). 


Nonoptimality  of  Individual  Sections 

We  pointed  out  that  if  a  tree  structured  filter  bank  is  optimal,  then  the  right-flushed  subtrees  (Fig.  10)  are  optimal  for 
their  inputs.  However,  the  left-flushed  subtrees  are  not  necessarily  optimal.  For  example,  the  gain  Gi  of  level  1  may 
not  be  optimal  for  the  primary  input  x(n).  In  fact,  if  we  optimize  each  stage  for  its  input  by  proceeding  from  left  to 
right,  it  may  not  yield  an  optimum  tree.  This  is  demonstrated  with  the  example  shown  in  Fig.  12,  for  a  two-level  dyadic 
tree.  The  choice  of  filters  shown  earlier  in  Fig.  11  yields  the  coding  gain  (11),  and  therefore  represents  an  optimal 
orthonormal  filter  bank.  Instead  of  using  this  filter  bank,  suppose  we  first  optimize  the  coding  gain  Gi  of  level  1.  The 
unique  ideal  level-1  filter  bank  for  this  is  shown  in  Fig.  12(b),  and  has  coding  gain  Gi  =  (2  +  c  -h  d)/2y/{l  -h  c)(l  -|-  d). 
The  decimated  subband  psd  at  level  1  are  shown  in  Fig.  12(c),  and  12(b).  The  optimum  level  2  filter  bank  yields  the 
coding  gain  G2  =  (1  4-  d)/2y/d  (assuming  this  is  greater  than  (1  +  c)/2«v/c)  so  that  the  overal  gain  is 


GiGl^^  = 


2  c  d 


X 


/  2v/c  \  1/2 

Vl  +  c/ 

> - ' 

a 
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which  is  the  optimal  gain  (11)  reduced  by  the  factor  a.  Since  a  <  1  unless  c  =  1,  this  filter  bank  is  not  optimal. 

Principal  Component  Property  And  Compaction 

As  shown  earlier,  unlike  in  uniform  filter  banks,  the  coding  gain  and  compaction  gain  are  not  directly  related  in  the  case 
of  nonuniform  orthonormal  filter  banks.  However,  there  is  a  simple  direct  relation  between  optimal  compaction  and  the 
principal  component  property  even  in  the  nonuniform  case. 

In  our  discussion  we  find  it  convenient  to  change  the  normalization  convention  for  the  filters.  Thus  consider  the 
example  of  a  3-level  tree  of  the  form  Fig.  9.  Assuming  that  the  filter  bank  is  orthonormal  we  have  +2<Txj  +4^^^  = 

8(7|.  We  would  like  to  rescale  the  analysis  filters  such  that  under  the  new  scaling  convention  «^X0  +<^Xi  +‘^X2 
or  more  generally,  ‘^xk  =  <^x-  This  is  accomplished  simply  by  dividing  each  two-channel  analysis  filter  Hkm{z)  by 

y/2  and  multiplying  the  corresponding  synthesis  filter  by  \/2.  Under  the  new  scaling  convention,  the  signals  y*,  indicated 
in  Fig.  9  have  variances  given  by 

=  ^lo  + 

If  the  tree  structured  filter  bank  has  the  principal  component  property  for  the  given  input  x(n),  then  the  partial  sum 

fc=0 

is  maximized  for  each  value  ofLinO<L<M-l.  That  is,  the  left-flushed  subtrees  indicated  in  Fig.  13  should  be 
such  that  their  top  filters 

are  optimal  compaction(2*‘''^)  filters  for  the  primary  input  x(n).  For  example,  Hio{z)  should  be  the  optimum  com- 
paction(2)  filter  for  x(n),  whereas  Hio{z)H2a{z^)  must  be  the  optimum  compaction(4)  filter  for  the  same  x(n)  and  so 
forth. 

To  explain  how  the  above  principal  component  property  can  be  satisfied,  consider  the  example  of  a  three-level 
tree  of  the  form  shown  in  Fig.  9.  First  design  an  optimum  compaction(8)  filter  for  the  input  x{n).  As  shown  in^®, 
this  filter  can  always  be  implemented  in  the  multirate-cascade  form  shown  in  Fig.  14  where  each  Hko{z)  is  an  optimal 
compaction(2)  filter  for  its  input.  Since  each  Hko{z)  is  such  that  \Hko{e^‘^)\‘^  is  Nyquist(2),  we  can  always  define  a  filter 
Hkiiz)  such  that  the  pair  {Hko{z),  Hki{z)}  is  a  two-channel  orthonormal  filter  bank.  Since  each  Hko{z)  is  an  optimum 
compaction  filter  for  its  input,  the  pair  {Hka{z),Hki{z)}  maximizes  the  coding  gain  for  its  input.  In  this  way,  the 
complete  tree  structure  is  defined  (by  Fig.  9)  and  satisfies  the  principal  component  property.  The  coding  gain  of  this 
prinicipal  component  filter  bank,  however,  is  not  necessarily  maximized  as  explained  earlier. 

5.  CONCLUDING  REMARKS 

We  now  summarize  a  number  of  related  results  derived  in^’^.  The  compaction  gain  in  Fig.  6  depends  on  the 

input  psd  as  well  as  the  filter  H'(e’").  The  maximum  compaction  gain  Gmax{M)  is  such  that  1  <  Gmax{M)  <  M.  We 
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have  Gmax{M)  =  1  if  and  only  if  Sxx(e-’")  =  and  Gmox(M)  =  M  if  and  only  if  has  an  aliasfree(M) 

support.  For  arbitrary  Sxx(e-’“),  the  gain  Gmax{M)  is  not  monotone  increasing  in  M,  though  this  can  be  shown  to  be 
true  for  monotone  Sxx(e-’^).  Finally,  the  optimized  coding  gain  Gsbc{M)  of  the  M  band  uniform  orthonormal  subband 
coder  is  not  necessarily  a  monotone  increasing  function  of  M  even  if  S*x(e-'“)  is  monotone.  This  is  unlike  the  behavior 
of  the  optimal  transform  coder  (KLT). 

For  fixed  number  of  subbands  M,  biorthogonal  filter  banks  can  in  general  provide  better  coding  gain  than  orthonor¬ 
mal  filter  banks.  In  fact,  the  coding  gain  of  an  optimal  orthonormal  filter  bank  can  almost  always  be  improved  by 
using  a  half- whitening  prefilter  and  a  post  filter  around  that  filter  bank.  The  optimal  combination  of 

the  prefilter  and  the  orthonormal  filter  bank,  which  maximizes  coding  gain,  is  given  in^®.  A  related  question 

of  interest  is,  what  is  the  class  of  input  psd  Sxx(e'’“)  for  which  an  optimal  orthonormal  filter  bank  is  as  good  as  a 
biorthogonal  one  (in  the  coding  gain  sense)?  For  uniform  filter  banks  with  fixed  M  it  is  shown  in^’^  that  this  is  the  case 
if  and  only  if  the  frequency  axis  [0,27r)  can  be  partitioned  into  M  aliasfree(M)  regions  such  that  5xx(e-^“)  is  a  constant 
in  each  region. 
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bank  V(n)  bank 


Fig.  1 .  (a)  The  maximally  decimated  subband  coder  schematic,  and 
(b)  the  polyphase  representation  in  the  uniform  case. 


Fig.  2.  The  brickwall  filter  bank  with  contiguous 
stacking,  (a)  One-sided  stacking,  and  (b)  two- 
sided  stacking  as  in  the  real-coefficient  case. 


Fig.  8.  A  two-level  dyadic  tree 
structured  filter  bank. 
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Fig.  10.  The  right-flushed  subtrees  of  an  optimal 
orthonormal  FB  are  optimal  for  their  inputs. 


Fig,  1 1.  An  input  psd  and  the  various  filters  in  a  two  level 
tree  structured  optimal  orthonormal  filter  bank 
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Fig.  12  (a)  An  input  psd,  (b)  the  optimal  level  1  filter,  and 
(c),  (d)  the  decimated  subband  power  spectra  at  level  1. 


Examples  of 


Fig.  14.  The  cascade  implementation  of  an  optimal 
compaction{8)  filter.  This  completely  defines 
the  3-level  dyadic  principal  component  filter  bank. 


Fig.  13.  The  left-flushed  subtrees  of  a  principal 
component  FB  define  several  optimal  compaction 
filters  for  the  primary  input  x(n). 


